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Abstract. Recognition by sperm lysin of the egg vitelline 
envelope may be one component in determining the spe- 
cies-specificity of fertilization in abalones. The amino acid 
sequences of lysin proteins of seven California abalone 
species were deduced from the cDNA sequences. This is 
the first extensive comparison of a gamete recognition 
protein from congeneric species. Each prelysin has a highly 
conserved signal peptide of 18 amino acids, followed by 
a mature sequence of 136-138 residues. Of 136 aligned 
positions, 68 have the same amino acid in all seven se- 
quences. The % identity relative to the red abalone lysin 
sequence is: white 90%, flat 83%, pinto 82%, pink 78%, 
black 71%, and green 65%. Hydropathy plots and a dis- 
tance tree of the seven lysins show that red, white, and 
flat lysins are more closely related to each other than to 
the lysins of the other four species. A hypervariable, spe- 
cies-specific, domain exists in all sequences between po- 
sitions 2-12. Amino acid replacements between any two 
lysins are mostly nonconservative. Analysis of the cDNA 
sequences shows the number of nonsynonymous substi- 
tutions (amino acid altering) exceeds the number of syn- 
onymous substitutions (silent) in 20 of the 21 pairwise 
comparisons of the seven sequences, indicating that pos- 
itive Darwinian selection must promote the divergence 
of lysin sequences. 


Introduction 


A striking feature of fertilization is the species specificity 
of sperm-egg interaction in mammals (O’Rand, 1988; 
Yanagimachi, 1988a, 1988b; Roldan and Yanagimachi, 
1989) and invertebrates (Giudice, 1973; Summers and 
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Hylander, 1975, 1976; Osanai and Kyozuka, 1982). 
Sperm-egg mixtures from the same species usually yield 
zygotes more efficiently than cross-species mixtures. Al- 
though cross-species hybrid zygotes can be obtained in 
mammals and invertebrates, the general observation is 
that much higher concentrations of sperm are needed in 
the insemination mixture to achieve fertilization. Blocks 
to cross-species fertilization can occur at four points in 
the process: induction of the sperm acrosome reaction by 
components of the egg surface, adhesion of sperm to the 
egg envelope, sperm penetration of the egg envelope, and 
fusion of sperm and egg cell membranes. In echinoderms, 
the greatest barrier to cross-species fertilization is the fail- 
ure of sperm to adhere to the egg vitelline envelope (Sum- 
mers and Hylander, 1975, 1976); in mammals it is the 
failure of sperm to adhere to and penetrate the egg zona 
pellucida (O’Rand, 1988; Yanagimachi, 1988a,b; Roldan 
and Yanagimachi, 1989). 

The divergence of gamete recognition proteins may be 
important in the establishment of barriers to cross-fertil- 
ization between populations. This may be an important 
factor in the speciation of marine invertebrates using ex- 
ternal fertilization. To learn how species-specific gamete 
recognition proteins have diverged during evolution, we 
have studied a protein from abalone sperm. Abalones are 
marine archeogastropods of the genus Haliotis. Approx- 
imately 70 extant species occur on coastlines of the world, 
eight of them on the Pacific Coast of North America. 
Although abalones are members of an ancient group of 
gastropods, the genus Haliotis is relatively recent, most 
fossils being from the Miocene (5-25 million years; Lind- 
berg, 1991). 

The abalone egg is contained within a glycoproteina- 
ceous vitelline envelope (VE) about 0.6 um in thickness 
(Lewis ef al., 1982). The spermatozoon possesses a rela- 
tively enormous acrosome granule (Lewis ef al., 1980; 
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Shiroya and Sakai, 1983) containing two abundant pro- 
teins of about Mr 18,000 and 16,000 (Lewis et al., 1982). 
During fertilization, the sperm attaches to the egg VE, 
the acrosome granule opens, and the two proteins are 
secreted. A hole in the VE, 3 um in diameter, is created 
in seconds, and the sperm passes through it to fuse with 
the egg (Lewis et al., 1982; Sakai ef al., 1982). Partial 
amino acid sequence analysis (41 residues of the NH>- 
terminus) shows that the Mr 18.000 protein is not a pre- 
cursor of the Mr 16,000 protein (Vacquier, unpubl.). 
When the purified Mr 16,000 acrosomal protein (sperm 
lysin) is added to eggs, the VE rapidly dissolves by a non- 
enzymatic mechanism: VE glycoproteins are not degraded 
and no new NH>-termini are formed (Haino-Fukushima, 
1974: Lewis ez al., 1982: Hoshi, 1985). As previously dis- 
cussed (Lewis ef al., 1982; Hong and Vacquier, 1986; 
Baginski et al., 1990; Vaequier et al., 1990), lysin may 
act by competing for hydrogen and hydrophobic bonds 
that hold the glycoproteinaceous fibers of the VE together. 

The cDNA for pink and red abalone sperm lysins had 
been previously cloned and sequenced (Vacquier et al., 
1990). To learn about the evolutionary divergence of 
sperm lysin in California abalones, the polymerase chain 
reaction was used to generate double stranded cDNA from 
five additional species. The analysis of the seven deduced 
amino acid sequences of lysin is the first extensive com- 
parison of a gamete recognition protein in congeneric 
species. We were surprised to find that the divergence of 
the lysin sequences is promoted by positive Darwinian 
selection. 


Materials and Methods 


The seven species of abalone used in this study were: 
laliotis corrugata (pink, M34389), H. cracherodii (black, 
M59971), H. fulgens (green, M59972), H. kamtschatkana 
(pinto, M59970). 71. rufescens (red. M34388). H. sorenseni 
(white, M59968), and H. wadallensis (flat, M59969). The 
GenBank cDNA sequence accession number follows the 
common name of each species. The testes of male abalone 
were removed and poly A+ RNA isolated as described 
(Chomczynski and Sacchi, 1987; Vacquier et al., 1990). 
Northern blot analysis with a full length red abalone lysin 
cDNA as the probe, revealed a single band of hybridization 
of approximately 660 nucleotides in all seven species 
(Vacquier et al, 1990; and unpubl.). Oligonucleotide 
primers were synthesized to the 5’ end of the previously 
reported red and pink cDNA sequences (primer 6: GAA- 
CAGATTACAAGATGAAGCTGT, the italicized ATG 
being the initiation codon), and to the complementary 
strand of the 3’ end of the sequence adjoining the poly A 
tail (primer 7; TAGTAAATCTATTTATTCTGGAAT, 
the italicized being the complement of the poly A signal 
sequence: Vacquier et al., 1990). 


Two to ten ug of poly A+ RNA were used for first 
strand synthesis (Frohman, 1990). The RNA was washed 
twice in 1 ml 80% ethanol, dried, and dissolved in 7 pl 
water containing 3 ul of primer 7 (30 pMol). The tube 
was heated to 95°C for 5 min and then placed on ice for 
10 min, followed by a 5-s centrifugation (a quick spin). 
A reaction mixture of 10 ul was added to the tube [the 
10 wl contained: 2.5 ul dNTP mix at 2 mAf of each nu- 
cleotide; 2.0 ul 10 X RTC buffer (Frohman, 1990); 1.0 ul 
human placental RNase inhibitor (Promega, Madison, 
Wisconsin); 2.0 ul MuLV reverse transcriptase (400 units): 
and 2.5 ul water]. This mixture of 20 ul was incubated 
for 1 h at 37°C. An additional 1 ul of MuLV reverse 
transcriptase was then added (200 units), and the incu- 
bation continued | h at 45°C. Following incubation, the 
sample was diluted with 2 ml 0.1 X TE (1 mM Tris, 0.1 
mA EDTA, pH 8.0) and concentrated to 50 ul with a 
Centricon-30 microconcentrator (Amicon Inc., Beverly, 
Massachusetts). The redilution in 2 ml 0.1 X TE and con- 
centration to 50-100 ul was done three times. Sixteen ul 
of this first strand cDNA product was used for the second 
strand synthesis. 

To each tube of 16 ul was added 33 ul of a mixture of 
5 ul PCR buffer (Frohman, 1990), 2.5 ul dimethylsulf- 
oxide, 5.0 ul dNTPs at 2 mM each nucleotide, 6.0 ul 
primer 7 in water (30 pMol), 6.0 ul primer 6 (30 pMol), 
and 8.5 ul water. The 49 ul volume (in a 0.5 ml tube) was 
heated to 95°C for 5 min and cooled slowly (1 h) to 50°C. 
After a quick spin, 0.5 ul of Taq polymerase was added 
(2.5 units, Perkin-Elmer-Cetus, Emeryville, California), 
the tube vortexed gently, and 50 ul mineral oil added. 
The tube was incubated for 15 min at 37°C followed by 
40 min at 72°C. Amplification of the lysin cDNA was 
accomplished in a temperature cycler by 40 cycles of 94°C 
for 1 min, 45°C for 1 min, and 72°C for 2 min. An ad- 
ditional 0.5 ul of the Taq enzyme was added after the first 
20 cycles. Following the last cycle, the temperature was 
held at 72°C for 15 min. The tubes were cooled to 23°C 
by a quick spin in a microfuge and 1.0 ul of Klenow 
fragment added, and the incubation continued for 30 min 
at 23°C. Agarose gel electrophoresis of the 50 ul reaction 
mixture showed the presence of one product of amplifi- 
cation of approximately 650 nucleotides. 

The amplified double stranded cDNA was purified ei- 
ther by three cycles of dilution in 2 ml of 0.1 X TE and 
concentration to 50-200 ul with a Centricon-30, or by 
separating the product by electrophoresis in 1% agarose 
in 0.5 X TAE (0.04 M Tris acetate, 0.001 AJ EDTA, pH 
8.0). The 650 base pair band was excised from the gel and 
the cDNA purified with Prep-A-Gene (Biorad Labora- 
tories, Richmond, California). The cDNA was quantitated 
by spectrophotometry and 1] yg aliquots stored in 100% 
ethanol at —20°C. For sequencing, | ug of DNA was 
washed twice in 1 ml 80% ethanol and dried. The DNA 
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Amino acid sequences of the seven abalone sperm lysins. Dots denote identity to the red lysin 


sequence, dashes are for alignment, and numbering refers to the red lysin sequence. The signal peptide spans 
positions —18 to —1. The single letter amino acid code is used. 


pellet was dissolved in 10 ul of sequenase buffer (U. S. 
Biochemicals, Cleveland, Ohio) containing 10 pMol of a 
sequencing primer. Eight different oligonucleotide primers 
were used for sequencing, all of them synthesized so as 
to correspond to the red abalone lysin cDNA sequence 
(Vacquier et al., 1990). The tube containing the mixture 
of DNA and primer was heated to 95°C for 5 min and 
snap frozen 5 min in a dry ice ethanol bath, then placed 
in an aluminum block precooled to —20°C, which was 
allowed to warm to 23°C over a 2-h period. Following a 
quick spin, the Sequenase protocol was performed and 
the sequences of both strands of cDNA determined a 
minimum of two times. The cDNA and amino acid se- 
quences were computer aligned and listed in order of sim- 
ilarity using the progressive alignment and tree building 
program given in Feng and Doolittle (1990). Hydropathy 
plots with a window of seven amino acids were done by 
the method of Kyte and Doolittle (1982). 

To determine whether amino acid replacement between 
any two lysin sequences conserved the class of residue, 
the 20 amino acids were divided into 5 classes following 


the structural considerations of Dickerson and Geis 
(1983). Synonymous and nonsynonymous nucleotide 
substitutions were computed by the methods of Li et al. 
(1985) and Nei and Gojobori (1986). 


Results 
Deduced amino acid sequences 


The deduced amino acid sequences, aligned and listed 
in order of descending similarity (Feng and Doolittle, 
1990), are presented in the single letter code in Figure 1. 
Assignments of the initiation methionine (M at position 
—18), the signal sequence of 18 amino acids, the NH>- 
terminal residue of the mature lysins being arginine (R 
at position 1), and the COOH-terminal residue being ly- 
sine (K at position 136) have been previously presented 
(Vacquier et al., 1990). In Figure |, dots denote identity 
to the red abalone lysin sequence and dashes are for align- 
ment. The signal sequences (positions —18 to —1) have 
been highly conserved during evolution and are typical 
of eukaryotes (von Heijne, 1985). Neither cysteines nor 
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sites for N-linked glycosylation are found in the mature 
lysins. The mature pink abalone lysin is 137 residues, 
black abalone lysin is 138, and the lysins of the five other 
species are 136 residues in length. Of 136 aligned positions, 
68 (50%) have the same amino acid in all seven species. 
The two longest regions of perfect identity are the eight 
residues between positions 88-95 and the 11 residues be- 
tween positions 52-62. There are four occurrences of two 
contiguous positively charged amino acids in all seven 
sequences (positions 47-48, 55-56, 71-72, and 94-95). 
The percent identity in amino acid residues for the 21 
pairwise comparisons of the seven lysin sequences (Table 
I) shows the decrease in similarity progressing from red 
lysin to green lysin. Green abalone lysin is equally dissim- 
ilar from each of the other six lysins, the percent identity 
varying from 63 to 65%. The region of greatest difference 
among all seven sequences is the | 1 residue segment com- 
prising positions 2-12 (Fig. 1). In this region, no two spe- 
cies have the identical sequence. Considerable difference 
in charge distribution is seen in this hypervariable seg- 
ment. For example, the red, flat, and pinto lysins have a 
net charge of +1, whereas pink abalone lysin has a net 
charge of +6. 


Hydropathy plots and branching order 


Hydropathy plots of the seven mature lysin sequences 
are of value in showing subtle differences throughout the 
sequences (Fig. 2). The plots of the hypervariable domain 
of positions 2-12 (shaded) are in most cases species-spe- 
cific. The upper three plots (red, white, flat) are quite sim- 
ilar, all having a large hydrophobic domain between res- 
idues 15-30. The pinto is clearly different from the top 
three, this large hydrophobic domain being reduced and 
followed by a hydrophilic domain centered at position 
30. The pink lysin has a hydrophilic domain centered at 
residue 60 that is more similar in shape to the one in the 
black and green lysins than it is to the other four species. 
There is a moderately hydrophobic domain at about po- 
sition 70 in pink lysin shared with only the black species. 
The pinto and green are the only two lysins having a large 


Table I 


Percent identity of amino acid residues in 21 pairwise comparisons 
in 136 aligned positions of seven lysins 








Species Red White Flat Pinto Pink Black 
White 90 

Flat 83 88 

Pinto 82 85 76 

Pink 78 80 Tit 72 

Black 71 72 T 65 78 

Green 65 64 65 63 65 65 


POSITION 
40 80 120 





Figure 2. Hydropathy plots of the seven abalone lysin sequences 
using a window of seven residues. Hydrophobic values are positive and 
hydrophilic values negative. The hypervariable domain of 2-12, and the 
two invariant domains of 52-62 and 88-95 are shaded. R, red; W, white; 
F, flat; Pt, pinto; Pk, pink; B, black; and G, green abalone lysins. 


hydrophobic peak close to position 100. The plots for the 
black and green are clearly distinct from the other species 
in having two large hydrophilic domains centered about 
positions 45 and 60. However, the plot of the black lysin 
shows a large hydrophilic domain around position 125 
which is not present in the green. The regions of !1 (po- 
sitions 52-62) and eight (positions 88-95) amino acids 
that are invariant in all seven sequences are shown as 
shaded zones (Fig. 2). Both these regions are amphipathic, 
being hydrophobic in the NH,-terminal direction and hy- 
drophilic in the COOH-terminal direction. 

A distance tree depicting the branching order of the 
seven lysin sequences (Fig. 3) shows red and white lysins 
to be the most closely related proteins. The black and 
green lysin sequences are the most divergent; they are far 
from the other five sequences and also far from each other. 


Armino acid replacements and nucleotide substitutions 


The 21 possible pairwise comparisons of the 7 lysin 
sequences were analyzed to determine the fraction of 
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Table II 


The replacement of amino acids in mature lysins is nonconservative 


Species Red White Flat Pinto Pink Black 
White Yi 

Flat hy "Ne 

Pinlo *hs Ya 1h 

Pink Fay Vag sy In 

Black a0 sg "sg a7 In 

Green Phe Sho Yyy Ysi a6 "Shag 


amino acid replacements that were between residues of 
the same amino acid class (conservative replacement). The 
data are shown in Table II, where the numerator is the 
number of replacements between amino acids of the same 
class, and the denominator is the total number of replace- 
ments in each pairwise comparison. In all but two com- 
parisons (flat X red and flat X white), conservative amino 
acid replacements are far below 50%. In summary, the 
majority of amino acid replacements between any two 
lysins involves switching the class of residue. 

The number of synonymous (Ds) and nonsynonymous 
(Dn) nucleotide substitutions per site were computed for 
the 21 pairwise comparisons of the seven lysin cDNA 
sequences (Nei and Gojobori, 1986). The data (Table III) 
show that in all but one comparison (flat X green) Dn is 


greater than Ds. In 6 of the 21 comparisons, the difference 
is significant at the 5% level, and in two at the 0.5% level. 
These data show that positive Darwinian selection is pro- 
moting the divergence of lysin sequences. Also, the closely 
related sequences (Fig. 3) of red, white, flat, and pinto 
abalone lysins exhibit the positive selection phenomenon 
more strongly than do the more widely divergent se- 
quences. 


Discussion 
Amino acid sequences 


Homology among the seven mature lysins is readily 
apparent (Fig. 1). The sequences align perfectly in 952 of 
955 amino acids. Lysins are constrained in length, varying 
from 136 to 138 amino acids. There is conservation of 
primary structure in that 68 of the 136 aligned positions 
have the identical amino acid in all seven sequences. With 
the exception of the hypervariable domain of positions 
2-12, these invariant 68 positions are spread throughout 
the lysin molecule with a slight concentration toward the 
central portion of the sequence. Of the 68 invariant po- 
sitions, 14 are occupied by residues that are highly con- 
served (Graur, 1985; single letter code, W = 3, G = 5, 
and Y = 6), and 24 by the group of seven amino acids 
that are replaced most frequently in mammalian proteins 
(Graur, 1985; T = 1,H=1,Q=2,F=3,1=4,M=5, 


Table If 


Percent synonymous (Ds) and non-synonymous (Dn) nucleotide substitutions per site 


Species Ds (SE) Dn 
Red X White 1.62 (1.32) 5.79 
Flat 2.61 (1.69) 10.08 
Pinto 2.76 (1.75) 10.71 
Pink 10.59 (3.53) 14.63 
Black 11.08 (3.64) 21.76 
Green AAN (5.31) 24.93 
While X Flat aS (2.16) 6.36 
Pinto 3.30 (1.91) 8.39 
Pink 9.92 (3.40) 13.69 
Black 11.42 (3.69) 20.82 
Green Die (5.27) 25.44 
Flat x Pinto 5.45 (2.48) 13.59 
Pink 11.67 (3.71) 15.86 
Black 13.42 (4.03) 22.70 
Green 25.02 (5.84) 23.78 
Pinto X Pink 9.76 (3.39) 18.05 
Black 13.71 (4.12) 26.06 
Green 22.66 (5.54) 24.49 
Pink X Black 9.31 (3.28) 13.12 
Green 16.03 (4.45) 24.02 
Black X Green 18.14 (4.80) 27.80 


* Significant at 5% level, ** at 0.5% level. 


Dn d= 
(SE) Ds Dn — Ds SE 
(1.40) 3.57 4.17* 1.92 
(1.89) 3.86 TAT 2.54 
(1.95) 3.88 TOSI 2.62 
(2.33) 1.38 4.04 4.23 
(2.95) 1.96 10.68* 4.69 
(3.22) 1.17 3.66 6.21 
(1.47) 1.50 2a 2.61 
(1.70) 2.54 5.09* 2.56 
(2.25) 1.38 Beil 4.08 
(2.87) 1.82 9.40* 4.67 
(3.26) 1.20 4.32 6.20 
(2.23) 2.49 8.14* 3.34 
(2.45) 1.36 4.19 4.45 
(3.04) 1.69 9.28 5.05 
(3.13) 0.95 —1.24 6.63 
(2.63) 1.85 8.29 4.29 
(3.31) 1.90 12358 5.28 
(3.18) 1.08 1.83 6.39 
(2.19) 1.41 3.81 3.94 
(3.15) 1.50 7.99 5.45 
(3.46) 1.53 9.66 5.92 
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and L = 8). Because the occupancy of these 68 positions 
is identical in all 7 lysins, we conclude that they are crucial 
to lysin’s role in fertilization in California abalones. 

The hypervariable domain (positions 2-12) is strikingly 
similar to the ligand binding domain of annexin IT (Becker 
et al., 1990), a membrane and lipid binding protein. An- 
nexin I] possesses an NH>-terminal 12 amino acid segment 
(Ac-STVHEILCKLSL) that binds its ligand (pI 1). Ligand 
binding induces the 12 residues to form a positively 
charged amphipathic «-helix that becomes buried in pI 1. 
The important structural features for binding between 
annexin II and pl l are the hydrophilic residue in position 
I and the hydrophobic side chains at positions 3, 6, 7, 
and 10. White abalone lysin has residues with hydrophobic 
side chains at positions 3, 6, 7, and 10, and the same 
positively charged residues at positions 4 and 9 (His and 
Lys) as has annexin Il. In red, pink, and pinto lysins, 3 
out of 4 residues at positions 3, 6, 7, and 10 have hydro- 
phobic side chains. Three of the seven lysins are positively 
charged at position 4 and six out of seven at position 9. 
The binding of lysin to its unknown VE ligand may thus 
be similar to the binding of annexin II to pl] (Becker et 
al., 1990). 

Much has been learned about protein-protein recog- 
nition by X-ray crystallographic studies of the binding of 
proteases with their inhibitor proteins, and the binding 
of antibody to antigen (Janin and Chothia, 1990). In the 
protease-inhibitor complexes, 10-15 residues of the in- 
hibitors make contact with 17-29 residues of the proteases. 
These numbers are consistent with the size of the lysin 
hypervariable domain. In antibodies, the antigen binding 
sites are disproportionately rich in residues with aromatic 
side chains. In the seven lysins, between positions 2-12, 
26 residues of a total of 77 (34%) in all 7 sequences have 
aromatic side chains, whereas by total amino acid com- 
position, only 17% of lysin residues are aromatic. This 
adds support to the concept that positions 2-12 in lysin 
may be involved in the binding of its VE ligand. We have 
not as yet quantitatively determined the ability of lysin 
to dissolve egg VEs in all 21 pairwise combinations of the 
7 species. However, we have demonstrated species spec- 
ificity in the cross combinations of red and pink abalone 
lysins and egg VEs proteins (Vacquier et al., 1990). 


Positive Darwinian selection in lysin divergence 


In most cases, when two orthologous proteins are di- 
verging, the frequency of synonymous (silent) nucleotide 
substitution (Ds) will be greater than that of nonsynon- 
ymous (amino acid altering) substitution (Dn). If positive 
Darwinian selection is promoting divergence of two pro- 
teins, the converse will be true. In positive selection there 
is adaptive value to alter the amino acid sequence. Positive 
selection has been proven at the molecular level in the 





following cases: the class I (Hughes and Nei, 1988, Hughes 
et al., 1990) and class I (Hughes and Nei, 1989) major 
histocompatibility complex antigens; the Vy genes of im- 
munoglobulins (Tanaka and Nei, 1989); the circumspo- 
rozoite antigen in Plasmodium (Hughes, 1991); human 
influenza A virus (Fitch et al., 1991); and the Adh locus 
in Drosophila (McDonald and Kreitman, 1991). The 
nonconservative nature of amino acid replacements be- 
tween lysins (Table II) provided the clue that positive se- 
lection might be promoting lysin divergence. Analysis of 
lysin CDNA sequences by the method of Nei and Gojobori 
(1986; Table III) shows that Dn exceeds Ds in 20 of 21 
pairwise comparisons. Among the closely related se- 
quences of the red, white, flat, and pinto abalone lysins 
(Fig. 3), Dn shows statistically significant higher values 
than Ds. Analysis by a similar method (Li ef al., 1985) 
yielded almost the same results. For example, in com- 
parisons of red, white, flat, and pinto lysin sequences, the 
average nonsynonymous value was 2.4 times greater than 
the synonymous value. The most extreme comparison 
was between the red and flat sequences, where the non- 
synonymous value was 3.4 times greater than the syn- 
onymous value. These data indicate a strong selective ad- 
vantage in altering the amino acid sequence of lysin. This 
is the first example of positive selection acting on a gamete 
recognition protein. With the exception of the Adh locus 
in Drosophila, the common attribute abalone lysins share 


PINTO 


Distance tree showing the branching relationships of the 


Figure 3. 
lysin sequences. The root of the Iree is placed arbitrarily al the midpoint. 
The numbers on the branches represent relalive evolutionary distances 
(Feng and Doolittle, 1990). 
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with the published examples of positive selection is the 
involvement in extracellular recognition. 

We cannot speculate about what could provide the se- 
lective pressure acting on lysin divergence. The demon- 
stration of positive selection does not prove that it is a 
causative factor responsible for speciation in abalones. 
Experimental evidence exists that abalone embryos tend 
to settle near their parents (Prince et al., 1987), and that 
genetic structure can occur within an abalone species in 
two populations separated by three km (Brown, 1991). 
Thus, speciation by geographic isolation probably occurs 
in abalones. Although the demonstration of positive se- 
lection in lysin divergence does not indicate how abalone 
populations split into distinct species, the possibility that 
it may accompany the speciation process should be con- 
sidered. The statistically significant data showing positive 
selection (Table HI) are between the closely related abalone 
species (Fig. 3). This suggests that a high frequency of 
nonsynonymous substitution (Dn) accompanies initial 
divergence, but Dn decreases as divergence increases. A 
similar situation occurs with the class | major histocom- 
patibility genes in which Dn is greater in intralocus as 
compared to interlocus comparisons (Hughes and Nei, 
1988). Thus, the reason that the positive selection data 
set is so robust for lysin may be due to the relatively recent 
appearance of these closely related species in the fossil 
record (20 million years ago; Lindberg, 1991). 

One might speculate that positive selection may cause 
allelic variation in abalone sperm lysins. However, two 
male pink abalones from San Diego, California and six 
male red abalones (two from San Nicholas Island, two 
from Mendocino, California, and two from San Diego) 
yielded identical species-specific cDNA sequences in both 
the 462 nucleotide open reading frame and in about 150 
nucleotides of the 3’ untranslated region containing the 
poly A signal sequence (Vacquier ef al., 1990). San Diego 
and Mendocino are separated by roughly 800 km of 
coastline, and by Point Conception, an important eco- 
logical barrier to larval transport. We tentatively conclude 
from these limited numbers of individuals that there is 
no major allelic variation in lysin sequences in the red 
abalone. The species-specific lysin sequences may thus be 
well fixed in the extant California species. As pointed out 
by a reviewer, there are currently no models to explain 
these data; they represent a genuine mystery for future 
research to solve. 

In the class | major histocompatibility antigens, the 
antigen recognition site exhibits positive selection, but the 
different alleles share many conserved structural features 
making their homology obvious over tens of millions of 
years of evolution (Hughes and Nei, 1988). Abalone sperm 
lysins are similar in that strong homology exists among 
all seven ltysins, yet 50% of the positions have species- 
specific amino acid replacements. Knowing the sequences 


of these seven sperm lysins begs the question as to the 
nature of sequence variation in the VE ligands that are 
the lysin “receptors” of the egg surface. We predict that 
these ligands will show the same pattern of variation; that 
is, some regions will be conserved in all species, while 
others will be hypervariable and species-specific. 


Acknowledgments 


We thank J. W. Swanson, R. McConnaughey, Dr. D. L. 
Leighton, and Dr. J. R. Pawlik for abalones, Dr. Wen- 
Hsiung Li for his computer program, Drs. Masatoshi Nei 
and Tatsuya Ota for their interest and considerable effort 
in preparing the data presented as Table 11I. Discussions 
with Drs. D. W. Smith, D.-F. Feng, R. F. Doo- 
little, W. M. Fitch, J. E. Minor, R. J. Britten, S. R. Palumbi, 
D. A. Powers, and F. Azam are gratefully acknowledged. 
Supported by NIH Grant HD12986 to V.D.V. and by a 
Korean Government Overseas Scholarship to Y.-H.L. 
This paper is dedicated to the memory of Professor Alberto 
Monroy. 


Literature Cited 


Baginsky, M. L., C. D. Stout, and V. D. Vacqnier. 1990. Diffraction 
quality crystals of lysin from spermatozoa of the red abalone (/faliotis 
rufescens). J. Biol. Chem. 265: 4958-4961. 

Becker, T., K. Weber, and N. Johnson, 1990. Protein-protein recog- 
nition via short amphiphilic helices; a mutational analysis of the 
binding site of annexin 1] for p11. EMBO J. 9: 4207-4213. 

Brown, L. D. 1991. Genetic variation and population structure in the 
blacklip abalone, Haliotis rubra. Aust. J. Mar. Freshwater Res. 42: 
77-90. 

Chomczynski, P., and N. Sacchi. 1987. Single-step method of RNA 
isolation by acid guanidinium thiocyanate-phenol-chloroform ex- 
traction. Anal. Biochem. 162: 156-159. 

Dickerson, R. E., and I. Geis. 1983. Hemoglobin: structure, function, 
evolution and pathology. Benjamin/Cummings Inc. Menlo Park, CA. 

Feng, D.-F., and R. F. Doolittle. 1990. Progressive alignment and phy- 
logenetic tree construction of protein sequences. Meth. Enzymol. 183: 
375-387. 

Fitch, W. M., J. M. E. Leiter, X. Li, and P. Palese. 1991. Positive 
Darwinian evolution in human influenza A viruses. Proc. Natl. Acad. 
Sci. USA 88: 4270-4274. 

Frohman, M. A. 1990. RACE: rapid amplification of cDNA ends. Pp. 
28-38 in PCR Protocols, M. A. Innis, D. H. Gelfand, J. J. Sninsky 
and T. J. White, eds. Academic Press, San Diego. 

Giudice, G. 1973. Pp. 162-174 in Developmental Biology of the Sea 
Urchin Embryo. Academic Press, San Diego. 

Graur, D. 1985. Amino acid composition and the evolutionary rates 
of protein-coding genes. J. AFol. Evol 22: 53-62. 

Haino-Fukushima, K. 1974. Studies on the egg membrane lysin of Te- 
gula pfeifferi: the reaction mechanism of the egg membrane lysin. 
Biochim. Biophys. Acta 352: 179-191. 

Hong, K., and V. D. Vacquier 1986. Fusion of liposomes induced by 
a cationic protein from the acrosome granule of abalone spermatozoa. 
Biochemistry 25: 543-550. 

Hoshi, M. 1985. Sperm lysins. Pp. 431-462 in Biology of Fertilization, 
C. B. Metz and A. Monroy, eds. Academic Press, San Diego. 

Hughes, A. L. 1991. Circumsporozoite protein genes of malaria par- 
asites (Plasmodium spp.): evidence for positive selection on immu- 
nogenic regions. Genetics 127: 345-353. 


104 Y.-H. LEE AND V. D. VACQUIER 


Hughes, A. L., and M. Nei. 1988. Pattern of nucleotide substitution 
at major histocompatibility complex loci reveals overdominant se- 
lection. Nature 335: 167-170. 

Hughes, A. L., and M. Nei. 1989. Nucleotide substitution at major 
histocompatibility complex class 1] loci: evidence for overdominant 
selection. Proc. Nail. Acad. Sci. USA 86: 958-962. 

Hughes, A. L., T. Ota, and M. Nei. 1990. Positive Darwinian selection 
promotes charge profile diversity in the antigen binding cleft of class 
1 MHC molecules. Mol. Biol. Evol. 7: 515-524. 

Janin, J., and C. Chothia. 1990. The structure of protein-protein rec- 
ognition sites. J. Biol. Chem. 265: 16027-16030. 

Kyte, J., and R. F. Doolittle. 1982. A simple method for displaying 
the hydropathic character of a protein. J. Mol. Biol. 157: 105-132. 

Lewis, C. A., D. L. Leighton, and V. D. Vacquier. 1980. Morphology 
of abalone spermatozoa before and after the acrosome reaction. J. 
Ultrastruct. Res. 72: 39-47. 

Lewis, C. A., C. F. Talbot, and V. D. Vacquier. 1982. A protein from 
abalone sperm dissolves the egg vitelline layer by a nonenzymatic 
mechanism. Dev. Biol. 92: 227-240. 

Li, W.-11., C.-f. Wu, and C.-C. Luo. 1985. A new method for estimating 
synonymous and nonsynonymous rates of nucleotide substitution 
considering the relative likelihood of nucleotide and codon changes. 
Mol. Biol. Evol. 2: 150-174. 

Lindberg, D. R. 1991, Evolution, distribution and systematics of Hal- 
iotidae. In Abalone of the World: Biology, Fisheries and Culture, A. 
Shepherd, M. J. Tegner, and S. A. Guzman Del Proo, eds. Blackwells 
Scientific Publishers, London. 

McDonald, J. H., and M. Kreitman. 1991. Adaptive protein evolution 
at the Adh locus in Drosophila. Nature 351: 652-654. 

Nei, M., and T. Gojobori. 1986. Simple methods for estimating the 
numbers of synonymous and nonsynonomous nucleotide substitu- 
tions. Mol. Biol. Evol. 3: 418-426. 

O’Rand, M. G. 1988. Sperm-egg recognition and barners to interspecies 
fertilization. Gamete Res. 19: 315-328. 


Osanai, K., and K. Kyozuka. 1982. Cross fertilization between sea urchin 
eggs and oyster spermatozoa. Gamete Res. 5: 49-60. 

Prince, J. D., T. L. Sellers, W. B. Ford, and S. R. Talbot. 1987. Ex- 
perimental evidence for limited dispersal of haliotid larvae (genus 
Haliotis; Mollusca: Gastropoda). J. Exp. Mar. Biol. Ecol. 106: 
243-263. 

Roldan, E. R. S., and R. Yanagimachi. 1989. Cross-fertilization between 
Syrian and Chinese hamsters. J. Exp. Zool. 250: 321-328. 

Sakai, Y. T., Y. Shiroya, and K. Haino-Fukushima. 1982. Fine struc- 
tural changes in the acrosome reaction of the Japanese abalone, Hal- 
iotis discus. Dev. Growth Differ. 24: 531-542. 

Shiroya, Y., and Y. T. Sakai. 1983. Fine structure of the spermatozoon 
in the Japanese abalone, Haliotis discus. J. Wayo Women's Univ. 
24: 253-267. 

Summers, R. G., and B. L. Hylander. 1975. Species-specificity of ac- 
rosomal reaction and primary gamete binding in echinoids. Exp. 
Cell Res. 96: 63-68. 

Summers, R. G., and B. L. Hylander. 1976. Primary gamete binding: 
the species-exclusive event of echinoid fertilization. Exp. Cell Res. 
100: 190-194. 

Tanaka, T., and M. Nei. 1989. Positive Darwinian selection observed 
at the variable region genes of immunoglobins. Mol. Biol. Evol. 6: 
447-459. 

Vacquier, V. D., K. R. Carner, and C. D. Stout. 1990. Species-specific 
sequences of abalone lysin, the sperm protein that creates a hole in 
the egg envelope. Proc. Natl. Acad. Sci. USA 87: 5792-5796. 

von Heijne, G. 1985. Signal sequences: the limits of variation. J. Mol. 
Biol. 184: 99-105. 

Yanagimachi, R. 1988a. Mammalian fertilization. Pp. 135-185 in The 
Physiology of Reproduction, E. Knobil and J. Neill, eds. Raven Press, 
New York. 

Yanagimachi, R. 1988b. 
port 32: 3-43. 


Sperm-egg fusion. Curr. Top. Membr. Trans- 


